Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 296 |
| Missing cells | 407 |
| Missing cells (%) | 5.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 75.3 KiB |
| Average record size in memory | 260.4 B |
Variable types
| NUM | 15 |
|---|---|
| BOOL | 8 |
| CAT | 3 |
| DATE | 1 |
Reproduction
| Analysis started | 2020-05-05 17:13:56.645330 |
|---|---|
| Analysis finished | 2020-05-05 17:14:31.800674 |
| Version | pandas-profiling v2.5.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
month is highly correlated with quarter and 1 other fields | High Correlation |
quarter is highly correlated with month and 1 other fields | High Correlation |
weekofyear is highly correlated with quarter and 1 other fields | High Correlation |
meanwd_udsprevisionempresa is highly correlated with meanwd_udsventa | High Correlation |
meanwd_udsventa is highly correlated with meanwd_udsprevisionempresa | High Correlation |
udsstock has 97 (32.8%) missing values | Missing |
udsventa has 61 (20.6%) missing values | Missing |
udsprevisionempresa has 79 (26.7%) missing values | Missing |
roll4wd_udsventa has 50 (16.9%) missing values | Missing |
meanwd_udsventa has 42 (14.2%) missing values | Missing |
roll4wd_udsstock has 18 (6.1%) missing values | Missing |
roll4wd_udsprevisionempresa has 60 (20.3%) missing values | Missing |
weekday has 42 (14.2%) zeros | Zeros |
sin_weekday has 42 (14.2%) zeros | Zeros |
roll4wd_udsprevisionempresa has 5 (1.7%) zeros | Zeros |
| Distinct count | 296 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10641.0 |
|---|---|
| Minimum | 21 |
| Maximum | 21261 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 1083 |
| Q1 | 5331 |
| median | 10641 |
| Q3 | 15951 |
| 95-th percentile | 20199 |
| Maximum | 21261 |
| Range | 21240 |
| Interquartile range (IQR) | 10620 |
Descriptive statistics
| Standard deviation | 6162.628011 |
|---|---|
| Coefficient of variation (CV) | 0.5791399315 |
| Kurtosis | -1.2 |
| Mean | 10641 |
| Median Absolute Deviation (MAD) | 5328 |
| Skewness | 0 |
| Sum | 3149736 |
| Variance | 37977984 |
| Value | Count | Frequency (%) | |
| 1533 | 1 | 0.3% | |
| 18453 | 1 | 0.3% | |
| 2757 | 1 | 0.3% | |
| 10893 | 1 | 0.3% | |
| 8805 | 1 | 0.3% | |
| 11901 | 1 | 0.3% | |
| 2253 | 1 | 0.3% | |
| 9957 | 1 | 0.3% | |
| 19605 | 1 | 0.3% | |
| 8301 | 1 | 0.3% | |
| Other values (286) | 286 | 96.6% |
| Value | Count | Frequency (%) | |
| 21 | 1 | 0.3% | |
| 93 | 1 | 0.3% | |
| 165 | 1 | 0.3% | |
| 237 | 1 | 0.3% | |
| 309 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 21261 | 1 | 0.3% | |
| 21189 | 1 | 0.3% | |
| 21117 | 1 | 0.3% | |
| 21045 | 1 | 0.3% | |
| 20973 | 1 | 0.3% |
| Distinct count | 296 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| Minimum | 2019-06-05 00:00:00 |
|---|---|
| Maximum | 2020-03-26 00:00:00 |
| Distinct count | 1 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 30 |
|---|
| Value | Count | Frequency (%) | |
| 30 | 296 | 100.0% |
Length
| Max length | 2 |
|---|---|
| Mean length | 2 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 2 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 2 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 2 | 100.0% |
| Distinct count | 108 |
|---|---|
| Unique (%) | 54.3% |
| Missing | 97 |
| Missing (%) | 32.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1130.286432160804 |
|---|---|
| Minimum | 39.0 |
| Maximum | 2275.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 39 |
|---|---|
| 5-th percentile | 488.4 |
| Q1 | 840 |
| median | 1111 |
| Q3 | 1409 |
| 95-th percentile | 1848.3 |
| Maximum | 2275 |
| Range | 2236 |
| Interquartile range (IQR) | 569 |
Descriptive statistics
| Standard deviation | 412.5649791 |
|---|---|
| Coefficient of variation (CV) | 0.365009229 |
| Kurtosis | 0.197372151 |
| Mean | 1130.286432 |
| Median Absolute Deviation (MAD) | 328.6583167 |
| Skewness | 0.03699693094 |
| Sum | 224927 |
| Variance | 170209.862 |
| Value | Count | Frequency (%) | |
| 1046 | 7 | 2.4% | |
| 982 | 6 | 2.0% | |
| 1201 | 6 | 2.0% | |
| 762 | 6 | 2.0% | |
| 749 | 5 | 1.7% | |
| 1447 | 5 | 1.7% | |
| 1356 | 4 | 1.4% | |
| 891 | 4 | 1.4% | |
| 1020 | 4 | 1.4% | |
| 1343 | 4 | 1.4% | |
| Other values (98) | 148 | 50.0% | |
| (Missing) | 97 | 32.8% |
| Value | Count | Frequency (%) | |
| 39 | 1 | 0.3% | |
| 64 | 1 | 0.3% | |
| 129 | 1 | 0.3% | |
| 211 | 1 | 0.3% | |
| 232 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 2275 | 1 | 0.3% | |
| 2261 | 1 | 0.3% | |
| 2157 | 1 | 0.3% | |
| 1989 | 1 | 0.3% | |
| 1951 | 1 | 0.3% |
| Distinct count | 80 |
|---|---|
| Unique (%) | 34.0% |
| Missing | 61 |
| Missing (%) | 20.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 585.795744680851 |
|---|---|
| Minimum | 137.0 |
| Maximum | 1938.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 137 |
|---|---|
| 5-th percentile | 262 |
| Q1 | 432 |
| median | 551 |
| Q3 | 698 |
| 95-th percentile | 957 |
| Maximum | 1938 |
| Range | 1801 |
| Interquartile range (IQR) | 266 |
Descriptive statistics
| Standard deviation | 254.7286657 |
|---|---|
| Coefficient of variation (CV) | 0.4348421238 |
| Kurtosis | 6.080875428 |
| Mean | 585.7957447 |
| Median Absolute Deviation (MAD) | 181.3127026 |
| Skewness | 1.780464488 |
| Sum | 137662 |
| Variance | 64886.69314 |
| Value | Count | Frequency (%) | |
| 511 | 9 | 3.0% | |
| 600 | 7 | 2.4% | |
| 492 | 7 | 2.4% | |
| 590 | 7 | 2.4% | |
| 442 | 6 | 2.0% | |
| 314 | 6 | 2.0% | |
| 649 | 6 | 2.0% | |
| 354 | 6 | 2.0% | |
| 432 | 6 | 2.0% | |
| 698 | 6 | 2.0% | |
| Other values (70) | 169 | 57.1% | |
| (Missing) | 61 | 20.6% |
| Value | Count | Frequency (%) | |
| 137 | 1 | 0.3% | |
| 147 | 1 | 0.3% | |
| 157 | 1 | 0.3% | |
| 216 | 1 | 0.3% | |
| 246 | 4 | 1.4% |
| Value | Count | Frequency (%) | |
| 1938 | 1 | 0.3% | |
| 1741 | 1 | 0.3% | |
| 1603 | 2 | 0.7% | |
| 1407 | 1 | 0.3% | |
| 1170 | 1 | 0.3% |
| Distinct count | 203 |
|---|---|
| Unique (%) | 93.5% |
| Missing | 79 |
| Missing (%) | 26.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2845.2350230414745 |
|---|---|
| Minimum | 0.0 |
| Maximum | 17426.0 |
| Zeros | 2 |
| Zeros (%) | 0.7% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 337.8 |
| Q1 | 1172 |
| median | 2322 |
| Q3 | 3609 |
| 95-th percentile | 7185.4 |
| Maximum | 17426 |
| Range | 17426 |
| Interquartile range (IQR) | 2437 |
Descriptive statistics
| Standard deviation | 2424.726318 |
|---|---|
| Coefficient of variation (CV) | 0.852205986 |
| Kurtosis | 7.871113381 |
| Mean | 2845.235023 |
| Median Absolute Deviation (MAD) | 1700.164497 |
| Skewness | 2.210779419 |
| Sum | 617416 |
| Variance | 5879297.718 |
| Value | Count | Frequency (%) | |
| 2322 | 2 | 0.7% | |
| 289 | 2 | 0.7% | |
| 1050 | 2 | 0.7% | |
| 123 | 2 | 0.7% | |
| 2987 | 2 | 0.7% | |
| 2084 | 2 | 0.7% | |
| 1613 | 2 | 0.7% | |
| 712 | 2 | 0.7% | |
| 2078 | 2 | 0.7% | |
| 3589 | 2 | 0.7% | |
| Other values (193) | 197 | 66.6% | |
| (Missing) | 79 | 26.7% |
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.7% | |
| 62 | 1 | 0.3% | |
| 98 | 1 | 0.3% | |
| 123 | 2 | 0.7% | |
| 169 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 17426 | 1 | 0.3% | |
| 13665 | 1 | 0.3% | |
| 11310 | 1 | 0.3% | |
| 10044 | 1 | 0.3% | |
| 10028 | 1 | 0.3% |
| Distinct count | 1 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 |
|---|
| Value | Count | Frequency (%) | |
| 0 | 296 | 100.0% |
festivo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 8 |
| Value | Count | Frequency (%) | |
| 0 | 288 | 97.3% | |
| 1 | 8 | 2.7% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.9966216216216215 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 42 |
| Zeros (%) | 14.2% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.997453142 |
|---|---|
| Coefficient of variation (CV) | 0.6665683542 |
| Kurtosis | -1.241520413 |
| Mean | 2.996621622 |
| Median Absolute Deviation (MAD) | 1.706560446 |
| Skewness | 0.004680305814 |
| Sum | 887 |
| Variance | 3.989819056 |
| Value | Count | Frequency (%) | |
| 3 | 43 | 14.5% | |
| 2 | 43 | 14.5% | |
| 6 | 42 | 14.2% | |
| 5 | 42 | 14.2% | |
| 4 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 0 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 0 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 2 | 43 | 14.5% | |
| 3 | 43 | 14.5% | |
| 4 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 6 | 42 | 14.2% | |
| 5 | 42 | 14.2% | |
| 4 | 42 | 14.2% | |
| 3 | 43 | 14.5% | |
| 2 | 43 | 14.5% |
| Distinct count | 4 |
|---|---|
| Unique (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 4 | |
|---|---|
| 3 | |
| 1 | |
| 2 |
| Value | Count | Frequency (%) | |
| 4 | 92 | 31.1% | |
| 3 | 92 | 31.1% | |
| 1 | 86 | 29.1% | |
| 2 | 26 | 8.8% |
Length
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
| Distinct count | 10 |
|---|---|
| Unique (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.993243243243243 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.667533456 |
|---|---|
| Coefficient of variation (CV) | 0.5244395666 |
| Kurtosis | -1.215710455 |
| Mean | 6.993243243 |
| Median Absolute Deviation (MAD) | 3.109751644 |
| Skewness | -0.3478227975 |
| Sum | 2070 |
| Variance | 13.45080165 |
| Value | Count | Frequency (%) | |
| 12 | 31 | 10.5% | |
| 10 | 31 | 10.5% | |
| 8 | 31 | 10.5% | |
| 7 | 31 | 10.5% | |
| 1 | 31 | 10.5% | |
| 11 | 30 | 10.1% | |
| 9 | 30 | 10.1% | |
| 2 | 29 | 9.8% | |
| 6 | 26 | 8.8% | |
| 3 | 26 | 8.8% |
| Value | Count | Frequency (%) | |
| 1 | 31 | 10.5% | |
| 2 | 29 | 9.8% | |
| 3 | 26 | 8.8% | |
| 6 | 26 | 8.8% | |
| 7 | 31 | 10.5% |
| Value | Count | Frequency (%) | |
| 12 | 31 | 10.5% | |
| 11 | 30 | 10.1% | |
| 10 | 31 | 10.5% | |
| 9 | 30 | 10.1% | |
| 8 | 31 | 10.5% |
| Distinct count | 43 |
|---|---|
| Unique (%) | 14.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.469594594594593 |
|---|---|
| Minimum | 1 |
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| median | 31 |
| Q3 | 42 |
| 95-th percentile | 50 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 15.97664889 |
|---|---|
| Coefficient of variation (CV) | 0.561182873 |
| Kurtosis | -1.229228509 |
| Mean | 28.46959459 |
| Median Absolute Deviation (MAD) | 13.65613587 |
| Skewness | -0.3266565044 |
| Sum | 8427 |
| Variance | 255.2533097 |
| Value | Count | Frequency (%) | |
| 52 | 7 | 2.4% | |
| 51 | 7 | 2.4% | |
| 29 | 7 | 2.4% | |
| 28 | 7 | 2.4% | |
| 27 | 7 | 2.4% | |
| 26 | 7 | 2.4% | |
| 25 | 7 | 2.4% | |
| 24 | 7 | 2.4% | |
| 12 | 7 | 2.4% | |
| 11 | 7 | 2.4% | |
| Other values (33) | 226 | 76.4% |
| Value | Count | Frequency (%) | |
| 1 | 7 | 2.4% | |
| 2 | 7 | 2.4% | |
| 3 | 7 | 2.4% | |
| 4 | 7 | 2.4% | |
| 5 | 7 | 2.4% |
| Value | Count | Frequency (%) | |
| 52 | 7 | 2.4% | |
| 51 | 7 | 2.4% | |
| 50 | 7 | 2.4% | |
| 49 | 7 | 2.4% | |
| 48 | 7 | 2.4% |
working_day
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 424.0 B |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) | |
| True | 246 | 83.1% | |
| False | 50 | 16.9% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.004759498821957385 |
|---|---|
| Minimum | -0.9749279121818236 |
| Maximum | 0.9749279121818236 |
| Zeros | 42 |
| Zeros (%) | 14.2% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | -0.9749279122 |
|---|---|
| 5-th percentile | -0.9749279122 |
| Q1 | -0.7818314825 |
| median | 0 |
| Q3 | 0.7818314825 |
| 95-th percentile | 0.9749279122 |
| Maximum | 0.9749279122 |
| Range | 1.949855824 |
| Interquartile range (IQR) | 1.563662965 |
Descriptive statistics
| Standard deviation | 0.7086201304 |
|---|---|
| Coefficient of variation (CV) | 148.8854514 |
| Kurtosis | -1.50521649 |
| Mean | 0.004759498822 |
| Median Absolute Deviation (MAD) | 0.6270716718 |
| Skewness | -0.0106157593 |
| Sum | 1.408811651 |
| Variance | 0.5021424891 |
| Value | Count | Frequency (%) | |
| 0.4338837391 | 43 | 14.5% | |
| 0.9749279122 | 43 | 14.5% | |
| -0.4338837391 | 42 | 14.2% | |
| -0.9749279122 | 42 | 14.2% | |
| -0.7818314825 | 42 | 14.2% | |
| 0.7818314825 | 42 | 14.2% | |
| 0 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| -0.9749279122 | 42 | 14.2% | |
| -0.7818314825 | 42 | 14.2% | |
| -0.4338837391 | 42 | 14.2% | |
| 0 | 42 | 14.2% | |
| 0.4338837391 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 0.9749279122 | 43 | 14.5% | |
| 0.7818314825 | 42 | 14.2% | |
| 0.4338837391 | 43 | 14.5% | |
| 0 | 42 | 14.2% | |
| -0.4338837391 | 42 | 14.2% |
cos_weekday
Real number (ℝ)
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.0037955736549281846 |
|---|---|
| Minimum | -0.9009688679024191 |
| Maximum | 1.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | -0.9009688679 |
|---|---|
| 5-th percentile | -0.9009688679 |
| Q1 | -0.9009688679 |
| median | -0.222520934 |
| Q3 | 0.6234898019 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1.900968868 |
| Interquartile range (IQR) | 1.52445867 |
Descriptive statistics
| Standard deviation | 0.7079619739 |
|---|---|
| Coefficient of variation (CV) | -186.5230498 |
| Kurtosis | -1.503349059 |
| Mean | -0.003795573655 |
| Median Absolute Deviation (MAD) | 0.6408877408 |
| Skewness | 0.009053080122 |
| Sum | -1.123489802 |
| Variance | 0.5012101565 |
| Value | Count | Frequency (%) | |
| -0.222520934 | 43 | 14.5% | |
| -0.9009688679 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% | |
| -0.9009688679 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| 1 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| -0.9009688679 | 42 | 14.2% | |
| -0.9009688679 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% | |
| -0.222520934 | 43 | 14.5% | |
| 0.6234898019 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 1 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| 0.6234898019 | 42 | 14.2% | |
| -0.222520934 | 43 | 14.5% | |
| -0.222520934 | 42 | 14.2% |
is_august
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 31 |
| Value | Count | Frequency (%) | |
| 0 | 265 | 89.5% | |
| 1 | 31 | 10.5% |
spring
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 | 5 |
| Value | Count | Frequency (%) | |
| 0 | 291 | 98.3% | |
| 1 | 5 | 1.7% |
summer
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 188 | 63.5% | |
| 1 | 108 | 36.5% |
autumn
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 206 | 69.6% | |
| 1 | 90 | 30.4% |
winter
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 200 | 67.6% | |
| 1 | 96 | 32.4% |
stockMissingType
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 KiB |
| 0 | |
|---|---|
| 2 | |
| 1 | 15 |
| Value | Count | Frequency (%) | |
| 0 | 199 | 67.2% | |
| 2 | 82 | 27.7% | |
| 1 | 15 | 5.1% |
Length
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 3 | 75.0% | |
| Other_Punctuation | 1 | 25.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
| Distinct count | 235 |
|---|---|
| Unique (%) | 95.5% |
| Missing | 50 |
| Missing (%) | 16.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 573.0759146341463 |
|---|---|
| Minimum | 190.2 |
| Maximum | 1293.25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 190.2 |
|---|---|
| 5-th percentile | 301.3125 |
| Q1 | 460.4642857 |
| median | 551.1875 |
| Q3 | 693.28125 |
| 95-th percentile | 875.75 |
| Maximum | 1293.25 |
| Range | 1103.05 |
| Interquartile range (IQR) | 232.8169643 |
Descriptive statistics
| Standard deviation | 172.1052204 |
|---|---|
| Coefficient of variation (CV) | 0.3003183627 |
| Kurtosis | 0.5630146335 |
| Mean | 573.0759146 |
| Median Absolute Deviation (MAD) | 137.5237316 |
| Skewness | 0.4154764392 |
| Sum | 140976.675 |
| Variance | 29620.20689 |
| Value | Count | Frequency (%) | |
| 518.5 | 3 | 1.0% | |
| 606 | 2 | 0.7% | |
| 709.125 | 2 | 0.7% | |
| 700.625 | 2 | 0.7% | |
| 680.75 | 2 | 0.7% | |
| 511 | 2 | 0.7% | |
| 479.25 | 2 | 0.7% | |
| 719.125 | 2 | 0.7% | |
| 539.5 | 2 | 0.7% | |
| 640.375 | 2 | 0.7% | |
| Other values (225) | 225 | 76.0% | |
| (Missing) | 50 | 16.9% |
| Value | Count | Frequency (%) | |
| 190.2 | 1 | 0.3% | |
| 207.4285714 | 1 | 0.3% | |
| 211.125 | 1 | 0.3% | |
| 228.25 | 1 | 0.3% | |
| 231.6 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 1293.25 | 1 | 0.3% | |
| 983.375 | 1 | 0.3% | |
| 977.375 | 1 | 0.3% | |
| 944 | 1 | 0.3% | |
| 936.75 | 1 | 0.3% |
| Distinct count | 6 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 42 |
| Missing (%) | 14.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 585.5999817451901 |
|---|---|
| Minimum | 397.2368421052632 |
| Maximum | 811.8717948717949 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 397.2368421 |
|---|---|
| 5-th percentile | 397.2368421 |
| Q1 | 461.575 |
| median | 568.155761 |
| Q3 | 704.375 |
| 95-th percentile | 811.8717949 |
| Maximum | 811.8717949 |
| Range | 414.6349528 |
| Interquartile range (IQR) | 242.8 |
Descriptive statistics
| Standard deviation | 139.513103 |
|---|---|
| Coefficient of variation (CV) | 0.2382395959 |
| Kurtosis | -1.097016964 |
| Mean | 585.5999817 |
| Median Absolute Deviation (MAD) | 115.0452121 |
| Skewness | 0.2891297061 |
| Sum | 148742.3954 |
| Variance | 19463.90591 |
| Value | Count | Frequency (%) | |
| 704.375 | 43 | 14.5% | |
| 560.4736842 | 43 | 14.5% | |
| 575.8378378 | 42 | 14.2% | |
| 461.575 | 42 | 14.2% | |
| 397.2368421 | 42 | 14.2% | |
| 811.8717949 | 42 | 14.2% | |
| (Missing) | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 397.2368421 | 42 | 14.2% | |
| 461.575 | 42 | 14.2% | |
| 560.4736842 | 43 | 14.5% | |
| 575.8378378 | 42 | 14.2% | |
| 704.375 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 811.8717949 | 42 | 14.2% | |
| 704.375 | 43 | 14.5% | |
| 575.8378378 | 42 | 14.2% | |
| 560.4736842 | 43 | 14.5% | |
| 461.575 | 42 | 14.2% |
| Distinct count | 248 |
|---|---|
| Unique (%) | 89.2% |
| Missing | 18 |
| Missing (%) | 6.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1158.1083119218913 |
|---|---|
| Minimum | 234.0 |
| Maximum | 2261.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 234 |
|---|---|
| 5-th percentile | 696.1571429 |
| Q1 | 892.55 |
| median | 1135.071429 |
| Q3 | 1385.0875 |
| 95-th percentile | 1740.7125 |
| Maximum | 2261 |
| Range | 2027 |
| Interquartile range (IQR) | 492.5375 |
Descriptive statistics
| Standard deviation | 352.7980329 |
|---|---|
| Coefficient of variation (CV) | 0.3046330203 |
| Kurtosis | 0.2689832869 |
| Mean | 1158.108312 |
| Median Absolute Deviation (MAD) | 283.7650596 |
| Skewness | 0.3389982677 |
| Sum | 321954.1107 |
| Variance | 124466.452 |
| Value | Count | Frequency (%) | |
| 1524 | 6 | 2.0% | |
| 1240 | 4 | 1.4% | |
| 1040 | 3 | 1.0% | |
| 930 | 3 | 1.0% | |
| 749 | 2 | 0.7% | |
| 1149.857143 | 2 | 0.7% | |
| 1130 | 2 | 0.7% | |
| 1628 | 2 | 0.7% | |
| 234 | 2 | 0.7% | |
| 1796 | 2 | 0.7% | |
| Other values (238) | 250 | 84.5% | |
| (Missing) | 18 | 6.1% |
| Value | Count | Frequency (%) | |
| 234 | 2 | 0.7% | |
| 240.8571429 | 1 | 0.3% | |
| 395.4 | 1 | 0.3% | |
| 465 | 1 | 0.3% | |
| 474 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 2261 | 1 | 0.3% | |
| 2193 | 1 | 0.3% | |
| 2157 | 1 | 0.3% | |
| 2125 | 1 | 0.3% | |
| 2057 | 1 | 0.3% |
meanwd_udsstock
Real number (ℝ≥0)
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1130.710683285964 |
|---|---|
| Minimum | 801.4516129032259 |
| Maximum | 1432.423076923077 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 801.4516129 |
|---|---|
| 5-th percentile | 801.4516129 |
| Q1 | 955.2857143 |
| median | 1055.258065 |
| Q3 | 1378.633333 |
| 95-th percentile | 1432.423077 |
| Maximum | 1432.423077 |
| Range | 630.971464 |
| Interquartile range (IQR) | 423.347619 |
Descriptive statistics
| Standard deviation | 222.9857939 |
|---|---|
| Coefficient of variation (CV) | 0.1972085319 |
| Kurtosis | -1.491811678 |
| Mean | 1130.710683 |
| Median Absolute Deviation (MAD) | 206.2324389 |
| Skewness | 0.04542798926 |
| Sum | 334690.3623 |
| Variance | 49722.66427 |
| Value | Count | Frequency (%) | |
| 1378.633333 | 43 | 14.5% | |
| 1055.258065 | 43 | 14.5% | |
| 1432.423077 | 42 | 14.2% | |
| 801.4516129 | 42 | 14.2% | |
| 985.92 | 42 | 14.2% | |
| 1301.896552 | 42 | 14.2% | |
| 955.2857143 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 801.4516129 | 42 | 14.2% | |
| 955.2857143 | 42 | 14.2% | |
| 985.92 | 42 | 14.2% | |
| 1055.258065 | 43 | 14.5% | |
| 1301.896552 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 1432.423077 | 42 | 14.2% | |
| 1378.633333 | 43 | 14.5% | |
| 1301.896552 | 42 | 14.2% | |
| 1055.258065 | 43 | 14.5% | |
| 985.92 | 42 | 14.2% |
| Distinct count | 230 |
|---|---|
| Unique (%) | 97.5% |
| Missing | 60 |
| Missing (%) | 20.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2862.405054479419 |
|---|---|
| Minimum | 0.0 |
| Maximum | 17426.0 |
| Zeros | 5 |
| Zeros (%) | 1.7% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 196.75 |
| Q1 | 1270.642857 |
| median | 2367.875 |
| Q3 | 3471.46875 |
| 95-th percentile | 7589.09375 |
| Maximum | 17426 |
| Range | 17426 |
| Interquartile range (IQR) | 2200.825893 |
Descriptive statistics
| Standard deviation | 2497.345349 |
|---|---|
| Coefficient of variation (CV) | 0.8724639951 |
| Kurtosis | 7.670456713 |
| Mean | 2862.405054 |
| Median Absolute Deviation (MAD) | 1702.768456 |
| Skewness | 2.268021193 |
| Sum | 675527.5929 |
| Variance | 6236733.794 |
| Value | Count | Frequency (%) | |
| 0 | 5 | 1.7% | |
| 206 | 2 | 0.7% | |
| 123 | 2 | 0.7% | |
| 2832.875 | 1 | 0.3% | |
| 2623 | 1 | 0.3% | |
| 1371.125 | 1 | 0.3% | |
| 13794.5 | 1 | 0.3% | |
| 951.25 | 1 | 0.3% | |
| 1868.25 | 1 | 0.3% | |
| 2588.125 | 1 | 0.3% | |
| Other values (220) | 220 | 74.3% | |
| (Missing) | 60 | 20.3% |
| Value | Count | Frequency (%) | |
| 0 | 5 | 1.7% | |
| 62 | 1 | 0.3% | |
| 98 | 1 | 0.3% | |
| 111.25 | 1 | 0.3% | |
| 123 | 2 | 0.7% |
| Value | Count | Frequency (%) | |
| 17426 | 1 | 0.3% | |
| 13794.5 | 1 | 0.3% | |
| 13665 | 1 | 0.3% | |
| 11310 | 1 | 0.3% | |
| 11131.75 | 1 | 0.3% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2301.6070133419885 |
|---|---|
| Minimum | 206.0 |
| Maximum | 4904.307692307692 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.4 KiB |
Quantile statistics
| Minimum | 206 |
|---|---|
| 5-th percentile | 206 |
| Q1 | 692.7692308 |
| median | 2219.675676 |
| Q3 | 3566.384615 |
| 95-th percentile | 4904.307692 |
| Maximum | 4904.307692 |
| Range | 4698.307692 |
| Interquartile range (IQR) | 2873.615385 |
Descriptive statistics
| Standard deviation | 1496.422306 |
|---|---|
| Coefficient of variation (CV) | 0.6501641232 |
| Kurtosis | -0.8705206805 |
| Mean | 2301.607013 |
| Median Absolute Deviation (MAD) | 1204.94844 |
| Skewness | 0.2685388148 |
| Sum | 681275.6759 |
| Variance | 2239279.717 |
| Value | Count | Frequency (%) | |
| 2641.921053 | 43 | 14.5% | |
| 3566.384615 | 43 | 14.5% | |
| 4904.307692 | 42 | 14.2% | |
| 2219.675676 | 42 | 14.2% | |
| 692.7692308 | 42 | 14.2% | |
| 1841.974359 | 42 | 14.2% | |
| 206 | 42 | 14.2% |
| Value | Count | Frequency (%) | |
| 206 | 42 | 14.2% | |
| 692.7692308 | 42 | 14.2% | |
| 1841.974359 | 42 | 14.2% | |
| 2219.675676 | 42 | 14.2% | |
| 2641.921053 | 43 | 14.5% |
| Value | Count | Frequency (%) | |
| 4904.307692 | 42 | 14.2% | |
| 3566.384615 | 43 | 14.5% | |
| 2641.921053 | 43 | 14.5% | |
| 2219.675676 | 42 | 14.2% | |
| 1841.974359 | 42 | 14.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| df_index | fecha | producto | udsstock | udsventa | udsprevisionempresa | promo | festivo | weekday | quarter | month | weekofyear | working_day | sin_weekday | cos_weekday | is_august | spring | summer | autumn | winter | stockMissingType | roll4wd_udsventa | meanwd_udsventa | roll4wd_udsstock | meanwd_udsstock | roll4wd_udsprevisionempresa | meanwd_udsprevisionempresa | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 21 | 2019-06-05 | 30 | 1266.0 | 738.0 | 11310.0 | 0.0 | 0.0 | 2 | 2 | 6 | 23 | True | 0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 0.0 | 738.00 | 560.473684 | 1266.0 | 1055.258065 | 11310.00 | 2641.921053 |
| 1 | 93 | 2019-06-06 | 30 | NaN | 944.0 | 17426.0 | 0.0 | 0.0 | 3 | 2 | 6 | 23 | True | 0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 2.0 | 944.00 | 704.375000 | NaN | 1378.633333 | 17426.00 | 3566.384615 |
| 2 | 165 | 2019-06-07 | 30 | NaN | 836.0 | 13665.0 | 0.0 | 0.0 | 4 | 2 | 6 | 23 | True | -0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 2.0 | 836.00 | 811.871795 | NaN | 1301.896552 | 13665.00 | 4904.307692 |
| 3 | 237 | 2019-06-08 | 30 | NaN | 295.0 | 2876.0 | 0.0 | 0.0 | 5 | 2 | 6 | 23 | True | -0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 2.0 | 295.00 | 397.236842 | NaN | 1432.423077 | 2876.00 | 692.769231 |
| 4 | 309 | 2019-06-09 | 30 | NaN | NaN | NaN | 0.0 | 0.0 | 6 | 2 | 6 | 23 | False | -0.781831 | 0.623490 | 0 | 0 | 1 | 0 | 0 | 2.0 | NaN | NaN | NaN | 955.285714 | NaN | 206.000000 |
| 5 | 381 | 2019-06-10 | 30 | NaN | 511.0 | 5371.0 | 0.0 | 0.0 | 0 | 2 | 6 | 24 | True | 0.000000 | 1.000000 | 0 | 0 | 1 | 0 | 0 | 2.0 | 511.00 | 575.837838 | NaN | 985.920000 | 5371.00 | 2219.675676 |
| 6 | 453 | 2019-06-11 | 30 | 849.0 | 541.0 | 3684.0 | 0.0 | 0.0 | 1 | 2 | 6 | 24 | True | 0.781831 | 0.623490 | 0 | 0 | 1 | 0 | 0 | 0.0 | 541.00 | 461.575000 | 849.0 | 801.451613 | 3684.00 | 1841.974359 |
| 7 | 525 | 2019-06-12 | 30 | 1508.0 | 492.0 | 1661.0 | 0.0 | 0.0 | 2 | 2 | 6 | 24 | True | 0.974928 | -0.222521 | 0 | 0 | 1 | 0 | 0 | 0.0 | 676.50 | 560.473684 | 1326.5 | 1055.258065 | 8897.75 | 2641.921053 |
| 8 | 597 | 2019-06-13 | 30 | 1938.0 | 698.0 | 2900.0 | 0.0 | 0.0 | 3 | 2 | 6 | 24 | True | 0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 0.0 | 882.50 | 704.375000 | 1938.0 | 1378.633333 | 13794.50 | 3566.384615 |
| 9 | 669 | 2019-06-14 | 30 | 1356.0 | 1033.0 | 3532.0 | 0.0 | 0.0 | 4 | 2 | 6 | 24 | True | -0.433884 | -0.900969 | 0 | 0 | 1 | 0 | 0 | 0.0 | 885.25 | 811.871795 | 1356.0 | 1301.896552 | 11131.75 | 4904.307692 |
Last rows
| df_index | fecha | producto | udsstock | udsventa | udsprevisionempresa | promo | festivo | weekday | quarter | month | weekofyear | working_day | sin_weekday | cos_weekday | is_august | spring | summer | autumn | winter | stockMissingType | roll4wd_udsventa | meanwd_udsventa | roll4wd_udsstock | meanwd_udsstock | roll4wd_udsprevisionempresa | meanwd_udsprevisionempresa | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 286 | 20613 | 2020-03-17 | 30 | 39.0 | 1741.0 | 1375.0 | 0.0 | 0.0 | 1 | 1 | 3 | 12 | True | 0.781831 | 0.623490 | 0 | 0 | 0 | 0 | 1 | 0.0 | 545.750000 | 461.575000 | 395.400000 | 801.451613 | 2486.000 | 1841.974359 |
| 287 | 20685 | 2020-03-18 | 30 | NaN | 551.0 | 2265.0 | 0.0 | 0.0 | 2 | 1 | 3 | 12 | True | 0.974928 | -0.222521 | 0 | 0 | 0 | 0 | 1 | 2.0 | 508.500000 | 560.473684 | 465.000000 | 1055.258065 | 3413.625 | 2641.921053 |
| 288 | 20757 | 2020-03-19 | 30 | 762.0 | 1407.0 | 3526.0 | 0.0 | 0.0 | 3 | 1 | 3 | 12 | True | 0.433884 | -0.900969 | 0 | 0 | 0 | 0 | 1 | 0.0 | 621.750000 | 704.375000 | 1153.000000 | 1378.633333 | 4447.250 | 3566.384615 |
| 289 | 20829 | 2020-03-20 | 30 | 1511.0 | 1938.0 | 3892.0 | 0.0 | 0.0 | 4 | 1 | 3 | 12 | True | -0.433884 | -0.900969 | 0 | 0 | 0 | 0 | 1 | 0.0 | 1293.250000 | 811.871795 | 1295.000000 | 1301.896552 | 8129.125 | 4904.307692 |
| 290 | 20901 | 2020-03-21 | 30 | 2275.0 | 246.0 | NaN | 0.0 | 0.0 | 5 | 1 | 3 | 12 | True | -0.974928 | -0.222521 | 0 | 0 | 0 | 0 | 1 | 0.0 | 765.625000 | 397.236842 | 1254.500000 | 1432.423077 | NaN | 692.769231 |
| 291 | 20973 | 2020-03-22 | 30 | 1410.0 | NaN | NaN | 0.0 | 0.0 | 6 | 1 | 3 | 12 | False | -0.781831 | 0.623490 | 0 | 1 | 0 | 0 | 1 | 0.0 | NaN | NaN | 1410.000000 | 955.285714 | NaN | 206.000000 |
| 292 | 21045 | 2020-03-23 | 30 | 1410.0 | NaN | 473.0 | 0.0 | 0.0 | 0 | 1 | 3 | 13 | True | 0.000000 | 1.000000 | 0 | 1 | 0 | 0 | 1 | 0.0 | 579.857143 | 575.837838 | 1410.000000 | 985.920000 | 2009.125 | 2219.675676 |
| 293 | 21117 | 2020-03-24 | 30 | 936.0 | NaN | 440.0 | 0.0 | 0.0 | 1 | 1 | 3 | 13 | True | 0.781831 | 0.623490 | 0 | 1 | 0 | 0 | 1 | 0.0 | 935.714286 | 461.575000 | 240.857143 | 801.451613 | 1839.875 | 1841.974359 |
| 294 | 21189 | 2020-03-25 | 30 | 1004.0 | NaN | 1153.0 | 0.0 | 0.0 | 2 | 1 | 3 | 13 | True | 0.974928 | -0.222521 | 0 | 1 | 0 | 0 | 1 | 0.0 | 542.142857 | 560.473684 | 644.000000 | 1055.258065 | 2661.000 | 2641.921053 |
| 295 | 21261 | 2020-03-26 | 30 | 1515.0 | NaN | 797.0 | 0.0 | 0.0 | 3 | 1 | 3 | 13 | True | 0.433884 | -0.900969 | 0 | 1 | 0 | 0 | 1 | 0.0 | 819.000000 | 704.375000 | 950.250000 | 1378.633333 | 3599.875 | 3566.384615 |